From Neural Sentence Summarization to Headline Generation: A Coarse-to-Fine Approach
نویسندگان
چکیده
Headline generation is a task of abstractive text summarization, and previously suffers from the immaturity of natural language generation techniques. Recent success of neural sentence summarization models shows the capacity of generating informative, fluent headlines conditioned on selected recapitulative sentences. In this paper, we investigate the extension of sentence summarization models to the document headline generation task. The challenge is that extending the sentence summarization model to consider more document information will mostly confuse the model and hurt the performance. In this paper, we propose a coarse-to-fine approach, which first identifies the important sentences of a document using document summarization techniques, and then exploits a multi-sentence summarization model with hierarchical attention to leverage the important sentences for headline generation. Experimental results on a large real dataset demonstrate the proposed approach significantly improves the performance of neural sentence summarization models on the headline generation task.
منابع مشابه
Multiple Alternative Sentence Compressions as a Tool for Automatic Summarization Tasks
Title of dissertation: MULTIPLE ALTERNATIVE SENTENCE COMPRESSIONS AS A TOOL FOR AUTOMATIC SUMMARIZATION TASKS David M. Zajic Doctor of Philosophy, 2007 Dissertation directed by: Professor Bonnie J. Dorr, advisor Professor Jimmy Lin, co-advisor Department of Computer Science Automatic summarization is the distillation of important information from a source into an abridged form for a particular ...
متن کاملHeadline Generation Based on Statistical Translation
Extractive summarization techniques cannot generate document summaries shorter than a single sentence, something that is often required. An ideal summarization system would understand each document and generate an appropriate summary directly from the results of that understanding. A more practical approach to this problem results in the use of an approximation: viewing summarization as a probl...
متن کاملHeadline extraction based on a combination of uni- and multidocument summarization techniques
The TNO system for multi-document summarisation is based on an extraction approach. For headline generation, we chose to extend our system to extract the most informative topical noun phrase. The cluster topic is defined as the most frequent term occurring in the most salient document sentences. The core of our system is a probabilistic model, which estimates the log-odds of salience based on a...
متن کاملConceptual Multi-layer Neural Network Model for Headline Generation
Neural attention-based models have been widely used recently in headline generation by mapping source document to target headline. However, the traditional neural headline generation models utilize the first sentence of the document as the training input while ignoring the impact of the document concept information on headline generation. In this work, A new neural attention-based model called ...
متن کاملTemplate-Filtered Headline Summarization
Headline summarization is a difficult task because it requires maximizing text content in short summary length while maintaining grammaticality. This paper describes our first attempt toward solving this problem with a system that generates key headline clusters and fine-tunes them using templates.
متن کامل